3 research outputs found

    Application of Principal Component Analysis to advancing digital phenotyping of plant disease in the context of limited memory for training data storage

    Get PDF
    Despite its widespread employment as a highly efficient dimensionality reduction technique, limited research has been carried out on the advantage of Principal Component Analysis (PCA)–based compression/reconstruction of image data to machine learning-based image classification performance and storage space optimization. To address this limitation, we designed a study in which we compared the performances of two Convolutional Neural Network-Random Forest Algorithm (CNN-RF) guava leaf image classification models developed using training data from a number of original guava leaf images contained in a predefined amount of storage space (on the one hand), and a number of PCA compressed/reconstructed guava leaf images contained in the same amount of storage space (on the other hand), on the basis of four criteria – Accuracy, F1-Score, Phi Coefficient and the Fowlkes–Mallows index. Our approach achieved a 1:100 image compression ratio (99.00% image compression) which was comparatively much better than previous results achieved using other algorithms like arithmetic coding (1:1.50), wavelet transform (90.00% image compression), and a combination of three transform-based techniques – Discrete Fourier (DFT), Discrete Wavelet (DWT) and Discrete Cosine (DCT) (1:22.50). From a subjective visual quality perspective, the PCA compressed/reconstructed guava leaf images presented almost no loss of image detail. Finally, the CNN-RF model developed using PCA compressed/reconstructed guava leaf images outperformed the CNN-RF model developed using original guava leaf images by 0.10% accuracy increase, 0.10 F1-Score increase, 0.18 Phi Coefficient increase and 0.09 Fowlkes–Mallows increase

    Improvement of plant disease classification accuracy with generative model-synthesized training datasets

    Get PDF
    Digitalization in agriculture requires critical research into applications of artificial intelligence to various specialization domains. This work aimed at investigating the application of image synthesis technology to the mitigation of the data volume constraint to digital plant disease phenotyping accuracy. We designed an experiment involving the use of a deep convolutional generative adversarial network (DC-GAN) to synthesize photorealistic data for healthy and bacterial spot disease-infected tomato leaves. The training dataset contained 1,272 instances per class. We further employed a 3-block visual geometry group (VGG) convolutional neural network (CNN) model with dropout regularization and 1 epoch to compare classification accuracies of the original dataset and various synthetic datasets. Our results showed that the third DC-GAN synthesized training dataset containing 3,816 synthetic examples of both healthy and bacterial spot infected tomato leaf classes outperformed the original training dataset containing 1,272 real examples of both tomato leaf classes (77.088% accuracy with the former dataset on a 3-block VGG CNN model with dropout regularization and 1 epoch, as compared to 76.447% accuracy with the latter dataset on the same classifier)

    Integrating genetic markers and adiabatic quantum machine learning to improve disease resistance-based marker assisted plant selection

    Get PDF
    The goal of this research was to create a more accurate and efficient method for selecting plants with disease resistance using a combination of genetic markers and advanced machine learning algorithms. A multi-disciplinary approach incorporating genomic data, machine learning algorithms and high-performance computing was employed. First, genetic markers highly associated with disease resistance were identified using next-generation sequencing data and statistical analysis. Then, an adiabatic quantum machine learning algorithm was developed to integrate these markers into a single predictor of disease susceptibility. The results demonstrate that the integrative use of genetic markers and adiabatic quantum machine learning significantly improved the accuracy and efficiency of disease resistance-based marker-assisted plant selection. By leveraging the power of adiabatic quantum computing and genetic markers, more effective and efficient strategies for disease resistance-based marker-assisted plant selection can be developed
    corecore